Generalizing the Layering Method of Indyk and Woodruff: Recursive Sketches for Frequency-Based Vectors on Streams
نویسندگان
چکیده
In their ground-breaking paper, Indyk and Woodruff (STOC 05) showed how to compute the k-th frequency moment Fk (for k > 2) in space O(poly-log(n,m) · n1− 2 k ), giving the first optimal result up to poly-logarithmic factors in n and m (here m is the length of the stream and n is the size of the domain.) The method of Indyk and Woodruff reduces the problem of Fk to the problem of computing heavy hitters in the streaming manner. Their reduction only requires polylogarithmic overhead in term of the space complexity and is based on the fundamental idea of “layering.” Since 2005 the method of Indyk and Woodruff has been used in numerous applications and has become a standard tool for streaming computations. We propose a new recursive sketch that generalizes and improves the reduction of Indyk and Woodruff. Our method works for any non-negative frequencybased function in several models, including the insertion-only model, the turnstile model and the sliding window model. For frequency-based functions with sublinear polynomial space complexity our reduction only requires log(n) overhead, where log(n) is the iterative log function. Thus, we improve the reduction of Indyk and Woodruff by polylogarithmic factor. We illustrate the generality of our method by several applications: frequency moments, frequency based functions, spatial data streams and measuring independence of data sets.
منابع مشابه
Recursive Sketching For Frequency Moments
In a ground-breaking paper, Indyk and Woodruff (STOC 05) showed how to compute Fk (for k > 2) in space complexity O(poly-log(n,m) · n 2 k ), which is optimal up to (large) poly-logarithmic factors in n and m, where m is the length of the stream and n is the upper bound on the number of distinct elements in a stream. The best known lower bound for large moments is Ω(log(n)n 2 k ). A follow-up wo...
متن کاملEstimating small frequency moments of data stream: a characteristic function approach
We consider the problem of estimating the first moment of a data stream defined as F1 = ∑ i∈{1,2,...,n}∣fi∣ to within 1± -relative error with high probability. Several algorithms are wellknown for this problem including the median estimator over p-stable sketches by Indyk [11], the geometric means estimator over p-stable sketches by Li [13] and the Hss sketch based algorithm in [8]. The current...
متن کاملروش بردارهای ریتز نوین برای تحلیل دینامیکی سازهها در حوزة فرکانس
Ritz method is one of the techniques for reduction of the degrees of freedom. Efficiency of Ritz method depends on used vectors. The Ritz method uses load depended vectors in spite of modal method and for this reason it is expected to give better results than modal method. This is the advantage of Ritz method. It is worth mention that the mode shapes are independent of loading. The vectors that...
متن کاملOn Estimating Frequency Moments of Data Streams
Space-economical estimation of the pth frequency moments, defined asFp = Pn i=1|fi| , for p > 0, are of interest in estimating all-pairs distances in a large data matrix [14], machine learning, and in data stream computation. Random sketches formed by the inner product of the frequency vector f1, . . . , fn with a suitably chosen random vector were pioneered by Alon, Matias and Szegedy [1], and...
متن کاملIdentical and Nonidentical Synchronization of Hyperchaotic Systems by Active Backstepping Method
This paper focuses on the tracking and synchronization problems of hyperchaotic systems based on active backstepping method. The method consists of a recursive approach that interlaces the choice of a Lyapunov function with the design of feedback control. First, a nonlinear recursive active backstepping control vector is designed to track any desired trajectory in hyperchaotic Wang system. Furt...
متن کامل